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ABSTRACT 


In  order  to  be  cost-effective,  difzital  circuits 
implemented  in  MOS  larpe  scale  intepration (LSI ) are  desipned 
with  close  regard  to  optimality.  Three  criteria  - power 
dissipation,  chip  area(cost),  and  speed  - are  balanced  to 
produce  a final  circuit.  Current  practices  involve  a 
significant  amount  of  simulation  to  determine  the  most 
satisfactory  trade-offs.  This  project  is  concerned  with  the 
prediction  of  optimal  values  of  certain  design  parameters 
for  a given  function  of  the  above  three  criteria. 

This  paper  first  develops  a simplified  model  for 
MOS/LSI  circuits.  Using  this  model,  equations  are  derived 
for  circuit  speed,  power  dissipation,  and  area.  These 
equations  are  expressed  as  functions  of  the  channe] 
dimensions  of  the  MOS  transistors  and  of  the  degree  of 
pipelining  of  the  circuit. 

These  three  equations,  and  their  pairwise  products,  are 
then  optimized.  The  relationships  between  the  eouations  and 
the  parameters  are  examined.  The  effects  and  trade-offs  of 
particular  design  choices  are  described. 
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single-channel  (P  or  N)  devices. 


Once  the  decision  to  iTplenent  a given  circuit  in 


MOS/LSI  is  made,  the  circuit  designer  has  Tany  decisions  to 


make.  For  some  systems,  the  logic  ,;ill  not  fit  economically 


onto  one  integrated  circuit  (IC)  and  must  be  partitioned 


appropriately.  When  the  circuit  designer  is  finally  readv 


to  implement  a given  logic  circuit  within  an  TC,  there  are 


still  many  parameters  with  which  to  he  concerned.  These 


fall  into  three  major  categories  - 1)  oarameters  related  to 


the  processing  to  be  used,  2)  parameters  related  to  the 


intended  operating  environment  of  the  TC,  and  3)  parameters 


over  v/hich  the  designer  has  direct  control 


Process  parameters  dictate  to  tho  designer  the  minima 


and  maxima  imposed  to  achieve  respectable  yields  or  the  IC 


production  line.  These  include  such  things  as  minimum  metal 


line  widths  and  minimum  transi stor-to-  transistor  spacing, 


Fnvironmental  parameters  dictate  the  required  interface 


from  the  IC  to  the  outside  world.  They  include  temperature 


ranges,  and  signtol  requirements  for  both  input  and  output 


Power  supply  voltages  are  often  .specified,  although  the 


circuit  designer  may  have  some  di«!c»'eti  op  in  order  to  meet 


oerformancp  criteria.  Process  and  environmental  nararetee?! 


are  constraints  and  bounds  from  the  circuit  d^sl '•’nor ' .s  noin^’ 


of  view. 
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The  remaining  parameters  are  the  orimary  tools  of  the 
circuit  designer.  Thus,  the  number  of  parameters  which  are 
totally  at  the  designer's  disposal  is  quite  small.  In 
MOS/LSI,  they  are  effectively  only  the  channel  dimensions  of 
the  MOS  transistors.  These  directly  affect  circuit  speed, 
power,  and  area,  which  is  related  to  circuit  cost. 

All  of  the  above  parameters  are  utilized  to  achieve 
some  design  goal  - minimum,  cost,  maximum  speed,  minimum 
power  dissipation,  or  some  combination  of  these.  This  paner 
deals  with  the  relationships  between  the  various  eoals  as 
evidenced  by  their  dependence  on  the  channel  dimensions  of 
the  transistors.  In  addition,  the  effect  of  pipelininp  is 
examined  as  an  additional  tool  of  the  circuit  designer. 

As  the  semiconductor  industry  considers  much  of  its 
data  proprietary,  raw  numbers  are  not  available.  It  is  the 
intent  of  this  paper  to  provide  a first  order,  or 
approximate,  characterization  of  MOS/LSI.  Thus,  the  results 
will  include  coefficients  which  are  unevaluated.  Comments 
and  conclusions  on  the  form  of  these  results  are  presented, 
including  the  trade-offs  implied.  Readers  with  access  to 
the  actual  figures  in  the  industry  should  be  able  to 
calculate  values  for  the  coefficients  and  obtain  first  order 
numerical  results. 

The  logic  design  of  the  circuit  to  be  implemented  is 
considered  to  be  frozen,  as  the  optimality  of  a logic  design 
is  not  within  the  scope  of  this  paper.  Given  a particular 


lopic  circuit,  then,  how  con  a circuit  designer  choose 


appropriate  channel  dimensions  and,  possibly,  the  degree  o 


pipelining  desired?  In  the  remainder  of  this  chaper 


necessary  background  in  MOS  digital  devices  is  provided 


Theses  topics  include  MOS  transistor  operation,  MOS  digital 


inverters,  the  Peta  ratio  concept,  'Uiltinle  input  digital 


gates,  and  layout  considerations 


Then,  in  Chapter  2,  a simplified  model  of  MOS  digital 


devices  is  developed.  The  model  is  first  used  to  describe  a 


single  MOS  inverter.  The  speed,  area,  and  power  dissipation 


are  then  specified  for  the  model.  Next,  arguments  are 


presented  to  extend  the  model  to  represent  more  complex 


circuits,  up  to  an  entire  integrated  circuit.  ^'odels  for 


storage  devices  are  developed,  which  also  model  the  degree 


of  Pipelining  of  a circuit 


Chapter  3 uses  the  model  developed  in  the  proceeding 


chapter  to  ascertain  optima  for  the  three  operating 


parameters  - speed,  power 


individually.  With 


area 


these  in  mind 


the  Beta  ratio  constraints  are  used  to 


simplify  the  model  further 


Chapter  4 presents  more  complex  optima,  those  involving 


tv;o  of  the  operating  parameters  at  a time.  <\mong  these  are 


speed-pov;er  product,  speed-area  product 


power-area 


product.  Also,  optimum  power  dissipation  and  optimum  area 


are  each  examined  assuming  a bound  on  the  speed 


Chapter  5 presents  two  numerical  examples  to 


demonstrate  the  methods  developed. 

Chapter  6 is  a short  conclusion. 


1.2  Operation  Of  MOS  Transistors 

For  our  purposes,  a detailed  description  of  the 

operation  of  the  MOS  transistor  is  not  needed.  The  model 

used  here  is  a simple  one,  and  details  may  be  obtained  from 

1 p o 

the  references. 

The  MOS  transistor  may  be  thought  of  as  a voltage  in, 
current  out,  three  terminal  device  (Fig.  1).  The 

transistor  gate  appears  as  an  extremely  high  resistance,  and 
may  be  modeled  as  a capacitor (There  is  a certain  ambiguity 
associated  with  the  word  "gate”.  In  this  paper,  "transistor 
gate”  refers  to  the  terminal  of  a MOS  transistor,  and 

"digital  gate"  or  "logic  gate"  refer  to  a combinatj onal 
logic  gate.).  The  source  and  drain  are  identical,  but  tbp 
one  whose  voltage  is  most  negative  is  labeled  as  the  source 
(in  N-channel).  A voltage  applied  to  the  transistor  gate 
must  exceed  the  source  by  a threshold  voltage,  V^,  or  more 
to  turn  the  device  on.  If  Vpg,  the  gate-to-source  voltage, 
is  less  than  V^,  the  device  is  off  and  the  drain  and  source 
are  electrically  isolated  (no  current).  On  the  other  hand, 
the  more  Vgg  exceeds  V^.,  the  lower  the  impedance  between 
drain  and  source.  It  is  useful  to  define  the  turn-on 


FP-5418 


Figure  1. 


Symbolic  lepresentation  of  MOS  Transistor 
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voltage,  as  the  amount  V^g  exceeds  the  threshold,  V^.  Thus, 
a negative  implies  that  the  device  is  off,  and  the 
value  of  when  positive,  is  a measure  of  how  bard  the 
device  is  turned  on. 

The  transistor's  operating  characteristics,  when  on, 
may  be  m.odeled  in  two  regions,  saturation  and 
non-saturation.  The  boundary  between  these  regions  occurs 
when 


ds  ’ 


(1-1) 


where  is  the  drain-to-source  voltage. 

The  regipn  known  as  saturation  occurs  when 
while  V^Qp  > Vjg  implies  non-saturation.  The  drain  current 
(=the  source  current)  is 


^d  “ —''ton 


(1-2) 


for  saturation  and 


!< ' ^ ( 2V^QpV^g  - V(^g  ) 


(1-3) 


for  non-saturation.^  Here  K’  is  a constant  coefficient  and  W 
and  L are  the  channel  dimensions  ( width , 1 ength ) of  the 
device . 


For  transistors  which  are  enhancement  mode,  the 
threshold  voltage,  V^,  is  a positive  non-zero  ouantity.  ?^o, 

Vton- 


1 * 


8 


ton. 


- V, 


For  depletion  mode  devices,  the  threshold  voltnpe  is 
nctually  negative,  so  a of  zero  is  already  in  the  on 

region.  It  is  customary  to  define 


Vp  = - V,  (1-5) 

for  depletion  mode  transistors . V , the  pinch-off  voltage, 

r ' 

represents  the  amount  of  voltage  which  is  turning  the  device 
on  when  the  gate-to-sour  ce  voltage,  is  zero.  So  for 

depletion  mode  transistors: 


ton. 


gs 


(1-f  ) 


A zero  Vp.g  results  in  a positive  non-zero  so  the  device 
is  normally  on  (enhancement  devices  ,sre  normally  off).  It 
would  be  necessary  to  apply  a negative  eaual  in 
magnitude  to  V,^  to  turn  it  off.  However,  this  is  not 
normally  done.  The  depletion  node  transistor  as  used  ir 
this  paper  has  = 0,  so  it  never  turns  off.  It  may  then 
be  used  as  a resistive  device,  whose  resistance  varies  with 
its  drain-to-source  voltage,  V^g. 


1.3  Digital  MOS  Oates 

Each  MOS  digital  gate,  usine  ratio  loeic,  consiats  of  a 
single  depletion  mode  translator,  called  the  load 
transistor,  and  one  or  more  enhancen-^nt  mode-  transistors. 


« -H  -f';  V ' 
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called  the  driver  transistors.  Ratio  lopic  refers  to  the 
fact  that  a logical  zero  is  the  result  of  a voltage  divider 
consisting  of  the  load  transistor  and  the  driver 
transistors.  The  simplest  conf ipuration , an  inverter,  has 
just  one  driver  transistor  and  is  shown  in  Fig.  2. 

1.3.1  Inverters  - 

The  load  transistor,  since  it  is  depletion  mode,  always 
remains  on.  The  driver,  though,  may  be  turned  on  or  off  by 
its  gate  voltage,  Actually,  the  transistors  are  all 
continuous  devices.  For  a given  may  be  determined 
by  setting  the  two  currents  equal  (taking  into  account 
the  saturation  or  non-saturation  of  each  device).^ 

A more  intuitive  approach  Is  taken  in  this  paper.  The 
dr ain-to-sour ce  impedance  of  the  depletion-mode  load  device 
can  be  viewed  as  a somewhat  constant  resistance,  certainly 
within  an  order  of  magnitude  or  so.  On  the  other  hand,  as 
V^P  goes  from  zero  to  ■►V,  the  drain-to-source  im.pedance  of 
the  driver  will  decrease  many  orders  of  magnitude.  The  two 
devices,  then,  may  be  viev/ed  as  a voltage  divider  to 
determine 


i 

i 


11 


1.3.2  Beta  Ratio  - 

A useful  tool  for  denoting  the  relationship  between  the 
two  resistances  of  ratio  lorric  (load  and  driver  sections)  is 
the  Beta  ratio.  It  describes  the  static  or  steadv-state 
operation  of  a digital  gate.  Beta  ratio  is  defined  as 

( -j—  )dri  ver 

Beta  = — (1-7) 

(— ^ ) load 


^'d^l 

W^Ld 


(1-H) 


Since  W/L  appears  explicitly  in  the  current  eciuation  for 
each  device,  Beta  measures  the  "strength"  of  the  driver 
transistor  with  respect  to  the  load  transistor.  For 
instance, for  small  values  of  Beta,  the  "resistance"  of  the 
driver  may  never  become  as  small  as  that  of  the  load.  In 
this  case,  the  output  would  remain  near  +V  for  any  value  of 
between  zero  and  +V.  Alternatively,  if  Beta  is  large,  a 
very  slight  increase  in  just  above  V^,  would  cause  ^out 

to  change  from  near  +V  to  near  ground.  Here,  the 

"resistance"  of  the  driver  becomes  very  much  less  than  that 
of  the  load  with  only  a very  small  value  for  These 

results  are  graphically  depicted  in  Fie.  3.  Bote  that  ^^out 
never  changes  until  Vj^p  exceeds  V^.  This  occt.'rs  because 
V^^P  for  the  driver  is  negative  for  V^p  < V^,  makinc  the 
driver  off  and  its  "resistance"  practically  infinite. 


< 

^ ; 
i 1 


(volts) 


Vin  (volts) 


Figure  3.  Inverter  Transfer  Characteristic  A,s  \ Function 
of  Beta-ratio 
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The  net  impact  of  Beta  ratio  is  a bound  on  the 
relationships  of  the  channel  dimensions  of  driver 
transistors  relative  to  their  load  transistors.  If  the  V.'/L 
for  the  driver  section  is  too  small  relative  to  the  WL  for 
the  load,  the  logic  gate  will  be  unable  to  achieve  a 
satisfactorily  low  voltage  or  zero  output  (even  in  the 
steady-state  or  D.C.  condition).  Since  there  are  physical 
minima  for  channel  dimensions  imposed  bv  the  production 
process,  Beta  may  be  increased  beyond  a certain  amount  only 
by  increasing  or  above  their  minima.  This  increase 
obviously  increases  the  area  of  a logic  gate,  so 
unreasonably  high  Beta  values  are  to  he  avoided  as  well. 


1.3.3  Multiple  Input  Gates  - 

More  complex  logic  gates  than  the  inverter  are  achieved 
by  making  series-parallel  combinations  of  driver 

transistors.  Examples  of  possible  configurations  are  shown 
in  Fig.  4.  The  circuit  in  Fig.  4a  represents  the  NAMD 

function  for  positive  logic  while  Fig.  4b  is  a NOR  circuit. 
Fig.  4c  realizes 

Output  = ((  In-|AIn2  )V  lnj4  ) . (1-9) 

In  other  words,  4c  is  a three-input  NOR  logic  gate  in  whic*^ 
one  of  the  inputs  is  actually  the  AND  of  two  inputs. 
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These  and  other  similar  configurations  comprise  the 
usual  combinational  logic  gates  in  ratio  logic.  Although 
bridges  and  other  more  exotic  configurations  are  possible, 
their  occurence  is  assumed  to  be  infrequent. 

1.3.4  Layout  - 


The 

physical 

layout 

of  the 

digital 

pates  is  an 

important 

topic , 

as  it 

r e ].  a t e s 

to  the 

f.  7 

area . ' ’ The 

transistors  are  basically 

surface 

devices. 

so  the  logic 

gates  and  circuits  may  be  described  in  two  dimensions. 

I Furthermore,  a circuit  is  usually  laid  out  in  a bus-oriented 

fashion  to  facilitate  distribution  of  power,  ground,  and 
logic  signals.  A representative  .section  of  logic  is 
pictured  in  Fig.  5.  In  this  figure,  four  inputs  enter  the 
section  of  logic  on  the  left,  while  four  outnuts  and  two 
inputs  exit  on  the  right,  for  use  in  succeeding  logic.  The 
, *1  dashed  regions  represent  the  areas  occupied  by  the  load  and 

driver  sections  of  each  digital  gate,  while  an  X denotes  a 
connection  between  an  interconnect  line  and  a digital  gate. 
As  can  be  seen, much  of  the  area  is  occupied  by  the  businm  of 
logic  signals,  power,  and  ground  to  appropriate  Ionic  pates. 
The  remainder  is  occupied  by  the  actual  transistors  and 
interconnections  of  each  iogic  pate. 

■| 
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2.  CHARACTFRIZATIOM  OF  oT’-'FLIFIED  FOS  WODEL 


In  this  chapter  the  sirTplified  description  of  FOS 
devices  is  used  to  develop  a set  of  equations  which  describe 
three  operating  functions  of  the  channel  dirrension 
parameters:  device  speed,  circuit  area,  and  device  nower 
dissipation.  These  functions  are  first  devised  for  a single 
FOS  inverter  and  then  extended  to  model  more  comnlex 
devices.  finally,  the  character  i7,ati  on  of  storage  elements, 
or  registers,  is  introduced  and  the  eouations  are  altered  to 
reflect  the  effects  of  pipelining. 

2.1  Character ization  Of  Inverters 

2.1.1  Inverter  Speed  - 

The  speed  of  r^m iconductor  devices  is  renerally 
nresented  by  describing  the  inverse  of  sneed,  time,  as  in 
the  speed-povfer  nroduct  which  has  t''«  units  of  watt-sec  (not 
watt/sec).  Therefore,  this  naner  uses  a 


function  related  to 
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device  delay  time  to  characterize  device  speed. 


( ' 
! ■' 


The  speed  of  an  MOS  inverter  could  he  represented  by 
the  rise  time  or  by  the  fall  time  of  the  device.  The  value 
chosen  here  is  the  sum  of  these  two: 

T = T^  + T^  . (2-1) 

This  choice  is  made  since  the  values  of  rise  and  fall  times 
are  usually  quite  different,  so  neither  alone  is  an  accurate 
measure.  Furthermore,  all  MOS  lorric  ?ates  have  a sinple 
inversion  embedded  in  them.  Thus  a sinn?*l  which  oronagates 
through  two  or  more  levels  of  logic  has  both  rising  and 
falling  delays. 


The  rise  time  of  a digital  inverter  is  dependent  only 
on  the  load  transistor,  since  the  driver  transistor  has  .iust 
been  turned  off  (The  time  to  turn  the  driver  off  is  included 
in  the  time  to  turn  the  previous  digital  gate  on.).  Given 
that  the  transistor  is  driving  a purely  capacitive  load,  the 
rise  time  is  inversely  proportional  to  the  drain  current: 

T„  < 4-  • (2-2) 

j. 

'Uso,  from  equations  1-2  and  1-3, 

W, 

II  <<  -r^  . (2-2) 

‘'1 


Therefore , 
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(2-^0 


or 


■i-«7 


(2-5) 


where  k-|  is  a constant  coefficient. 

The  fall  time  is  more  complicated  since  both  load  and 
driver  transistors  are  on.  This  situation  may  be  modeled  as 
in  Fig.  6.  This  means  that 


out 


- ( Id  - Ii  ) 


(2-6) 


;'v! 

I-  ■' 
I . 

^ i 


and  that  the  fall  time  is  inversely  proportional  to  the 
difference  of  the  two  device  currents.  For  the  device 
output  to  fall  at  all,  must  be  greater  in  magnitude  than 
Ii.  The  steady-state  requirements  mentioned  in  the  Beta 


purposes , 


out 


> such  that 

Id 

must  be 

in  order 

to 

obtain  a 

Thus,  for 

all 

practical 

Id  • 

(2-7) 
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(2-9) 


With  these  equations,  the  total  time  is  represented  by 


T 


♦ T, 


(2-10) 


2.1.2  Inverter  Power  - 

The  power  dissipation  of  an  WOS  inverter  is  relatively 
simple  to  evaluate.  When  the  driver  is  off,  there  is  a 
virtual  open  circuit  between  +V  and  pround,  so  no  power  is 
dissipated  in  the  static  case.  On  the  other  hand,  when  the 
driver  is  on,  the  output  voltape  is  near  pround  (as  a result 
of  the  Beta  ratio  criterion).  Thus,  almost  all  of  the  power 
is  dissipated  in  the  load  transistor.  Power  is  then 

P < , or  (2-11) 

P = ko-pi-  , (2-12) 

•-1 

where  is  a constant  coefficient  which  includes  a factor 
for  the  duty  cycle  (percent  on  time)  of  the  device. 

The  above  equation  for  the  static  power  dissipation  is 
in  fact  the  total  power  dissipated  by  the  device.  The 
output  load  on  the  inverter  is  assumed  to  be  purely 
capacitive.  Therefore  during  any  complete  0 to  1 to  0 (or  1 


to  0 to  1)  cycle,  the  total  energy  supplied  to  the  capacitor 
is  zero.  Effectively,  the  capacitor  'pay  be  removed  for  a 
power  consumption  model.  Then  the  logic  gate  in  the  model 
switches  in  zero  time  and  the  power  is  as  in  eauation  2-12, 

2.1.3  Inverter  Area  - 

The  area  of  an  MOS/LSI  integrated  circuit  is  a useful 
cost-related  measure. 

For  a single  MOS  inverter,  only  part  of  the  area  is 
related  to  the  channel  widths  and  lengths  of  the  two 
transistors.  As  shown  in  Fig.  5,  a significant  portion  of 
the  area  of  an  IC  is  occupied  by  power  supply  lines  and  by 
signal  lines.  These  area  components  are  a function  of  the 
logic  equations  being  implemented  and  of  the  skill  of  the 
layout  designer.  For  this  paper,  the  logic  equations  are 
assumed  to  be  in  their  final  form,  hopefully  near  optimum. 
Also,  the  assumption  is  made  that  the  layout  is  well  done, 
with  very  little  wasted  space.  Changes  in  the  area  of 
certain  portions  of  the  IC  are  assumed  not  to  affect  other 
areas  to  any  great  extent.  In  other  words,  the  layout  is 
considered  to  be  plastic  such  that  changes  or  alterations  in 
certain  places  do  not  cause  inefficiencies  or  wasted  space 
elsewhere . 


t. 
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With  these  assumptions  in  mind,  the  area  can  be 
considered  as  the  sum  of  the  area  directly  affected  by  the 
channels  and  of  the  remainder,  or  overhead.  This  assumption 
results  in  the  formula 


A = ki,  + I<5WiLi  + 


(2-13) 


where  kj|  is  the  overhead  area,  ‘and  and  kf^  are  constant 
coefficients  of  the  two  channel  areas. 


The  characterization  of  a single  mqs  inverter  is 
summarized  in  Table  1.  Here,  the  speed,  power,  and  area  of 
the  device  are  represented  as  functions  of  the  four  channel 
dimensions  - W^,  W^j,  L^,  and  L^. 


2.2  Characterization  Of  Complex  Gates 


The  parametric  eo.uations  presented  to  model  a single 
MOS  inverter  are  also  valid  for  more  complex  pates,  with 
some  further  assumptions.  After  all,  a complex  gate  is  only 
an  inverter  with  additional  driver  transistors  in  series  or 
in  parallel. 


For  example,  the  speed  of  a complex  pate  may  be  made 
comparable  to  that  of  an  inverter  by  appropriately  nicking 
the  channel  widths  and  lengths.  In  general,  the  channel 
widths  and  lengths  may  he  chosen  to  create  faster  or  slower 
rise  and  fall  times  independently.  Whatever  decision  is 
made,  however,  the  speed  of  the  complex  gates  can  be  easily 


Table  1. 

Characterization  of  MOC  Inverter 


Function 


Equation 
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related  to  the  speed  of  the  single  inverter.  For  example, 
the  doubling  of  the  speed  of  the  inverter  by  changing 

dimensions  could  be  matched  by  changing  the  dimensions  of 
the  complex  gates  by  a proportional  amount.  This  linear 

relationship  results  from  dependence  on  the  same  operating 
equations  by  all  logic  gates. 

In  the  case  of  power  dissipation,  an  even  simpler 

relation  exists.  The  power  is  dependent  only  on  the  load 

transistor,  and  all  logic  gates,  regardless  of  complexity, 

have  only  one  load  transistor.  Therefore,  the  power 

dissipated  in  a given  complex  pate  is  the  same  as  that  of  ; 

i 

the  standard  inverter,  with  the  same  load  transistor  channel 
dimensions. 

* 

I 

The  area  of  complex  gates  is,  in  general,  greater  than 
that  of  an  inverter.  In  effect,  the  area  of  a complex  gate 
is  composed  of  the  same  components  as  in  Table  1 . ><t(  j 

j 

remains  the  "overhead”  area,  which  may  be  increased  due  to  | 

the  additional  interconnection  needed  for  the  additional  j 

I 

inputs.  The  area  of  the  driver  transistors  is  a multiple  of 
the  driver  transistor  area  for  an  inverter,  with  a possible  i 

factor  for  differences  in  channel  dimensions.  As  before, 
the  area  of  complex  gates  is  also  linearly  related  to  the 
area  of  an  inverter. 

The  linearity  of  the  relationships  between  all  logic 
gates  yields  a useful  result.  A percentage  change  in  any  of 
the  four  parameters  - W^,  W^,  L^,  - applied  universally 
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to  all  Rates  on  a given  IC  would  result  in  new  speed,  oower, 
and  area  specifications.  In  fact,  if  the  coefficients  in 
Table  1 are  altered  to  model  the  average  logic  gate  on  an 
IC,  then  they  would  describe  the  operation  of  the  entire  IC. 
For  instance,  a W%  increase  in  the  of  every  driver 

transistor  on  the  IC  would  alter  the  specifications  of  the 
IC  as  predicted  by  the  Table  1 equations,  "modified  to 
describe  the  average  logic  gate.  , L-|  , and  now 

each  describe  the  average  channel  dimensions  on  the  IC.  In 
the  speed  equation,  k-|  remains  the  same,  while  kp  incJudes  a 
factor  related  to  the  average  number  of  average  drivers  in 
series,  i.e.,  the  slowest  path.  In  the  power  equation, 
now  includes  a factor  for  the  average  number  of  load  devices 

in  the  ON  state.  In  the  area  equation,  kij  adds  all  of  the 

interconnect  and  periphery  area  - output  bonding  pads  and 
off-chip  drivers.  This  area  is  related  to  environmental  and 
processing  parameters  which  are  unavailable  to  the  designer, 
k^  is  unchanged  since  there  is  exactly  one  load  device  on 
each  logic  gate,  and  kg  includes  a factor  for  the  average 

number  of  average  driver  transistors  in  the  average  logic 
gate . 


2.3  Storage  Registers 


This  section  deals  with  the  insertion  of  stcrape 
elements  or  registers.  Up  to  this  point,  the  discussion  has 
been  restricted  to  combinational  logic.  P"irst,  a register 
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model  is  developed.  Then  this  model  is  used  to  extend  the 
parametric  equations  to  sequential  machines  and  to  more 
general  multi-stage  pipeline  machines. 

2.3.1  Register  Model  - 

In  MOS,  many  implementations  of  storage  elements  exist, 
such  as  two-phase,  four-phase,  ratio,  and  ratio-less . ^ ^ 
Rather  than  try  to  model  the  specific  differences  amonp  all 
of  these,  a generalized  register  model  is  used.  Given  any 
one  regjister  type,  its  delay  or  speed  is  assumed  to  be  a 
constant.  Its  area  is  also  a constant  which  takes  into 
account  the  average  number  of  bits  in  a register  and  the 
area  per  storage  element.  Likewise,  the  power  dissipated  in 
each  register  is  modeled  as  a constant,  dependent  on  the 
register  type  and  the  average  bits  per  register. 


2.3.2  Pipelining  - 

Pipelining  is  a style  of  logic  architecture  in  which 
computation  throughput  is  increased  by  allowing  overlap  in 
the  processing,  of  successive  tasks.  The  logic  is  divided 
into  stages,  and  the  degree  of  pipelining,  m,  is  equal  to 
the  number  of  stages.  Each  stage  consists  of  some  amount  of 
logic  followed  by  a register  to  buffer  th'^  stame  outnut,  as 
in  Fig.  7.  The  logic  in  each  stage  may  be  of  arbitrary 
complexity.  The  registers  are  generally  sated  or  clocked  at 


^ lOr-  V*-  -. 


.Jit.. 


regular  intervals  v/hicb  are  greater  than  the  maximurr,  of  the 
delays  of  the  stapes.  In  a pipelined  implerrentation , new 
input  operands  may  be  presented  at  each  register  clock  time 
and  outputs  appear  at  each  repi.ster  clock  tire.  The  total 
time  from  input  to  output  is  generally  greater  than  in  the 
nonpipelined  case,  but  the  sy.ster  throughnut  may  be  rreatly 
increased . 


2. 3. 2.1  Degree  0 Pipeline 


A pipeline  of  degree  0 has,  effectively,  0 stages.  The 
pipeline  of  degree  0 is  taken  to  represent  an  MOS  circuit 
which  is  entirely  combinational  v/ith  no  re^ist^rs.  P'or  this 
case,  the  parametric  equations  in  Table  1 are  sufficient. 


2. 3. 2. 2 Degree  1 Pipeline  - 


The  degree  1 pipeline  consists  of  a single  (arbitrarily 
complicated)  stage  of  combinational  logic  followed  by  a 
register.  The  pipeline  of  deeree  1 is  analogous  to  a 
sequential  or  finite- state  machine.  In  this  case,  the 
equations  become: 


Speed : 


Com'binat  ional  delay  + Peaist^r  delay  (2-1't) 


L,  1.. 

k 1 — i-  + I<  T 

' W, 


(2-15) 


■ 


Power : 


Area : 


P = k 


VJt 

o — — + k o 

3 Li  3 


A = ki^  + ksWiLi  + + ;<9 


, and  (2-16) 


(2-17) 


where  ky  is  the  delay  of  the  register  in  use,  kp  is  the 
power  dissipation  in  the  register,  and  kg  is  the  register 


area . 


2. 3. 2. 3 Degree  M Pipeline  - 


In  general,  a pipeline  may  consist  of  any  number  of 
stages.  Also,  the  degree  of  pipelining  of  a given  circuit 
may  be  increased  by  subdivision  in  many  cases. For  a 
finite-state  machine  (degree  1)  such  ns  Fig.  8a,  where  the 
combinational  logic  consists  of  several  levels  of  logic,  the 
combinational  section  may  be  separated  at  aporoximatel y the 
mid-point  of  delay,  and  a register  inserted  as  in  Fig.  8b. 
The  total  delay  through  the  machine  is  increased  bv  one 
register  delay,  but  the  throughput  is  definitely  increased 
because  new  input  operands  may  be  introduced  after  the 
previous  inputs  have  passed  through  approximately  one-half 
the  combinational  delay  instead  of  the  entire  combinational 
delay  as  before. 


In  actuality,  the  subdivision  orocess  need  not  be  done 
in  two  parts.  Any  number  of  registers  may  be  inserted,  up 
to  a maximum  of  one  after  each  level  of  logic.  In  these 
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cases,  the  optimum  clock  interval  is  the  maxi>^um  of  the 
delays  between  the  registers.  In  general,  the  system 
performance  increases  asymptotically  toward  a maximum  value 
as  the  degree  of  pipelining  increases.  Simultaneously,  the 
system  cost  increases  in  approximately  linear  fashion  with 
an  increase  in  pipeline  degree.'^ 

A finite-state  machine  with  feedback  is  depicted  in 
Fig.  9a.  Here,  pipelining  by  subdivision  may  be 
accomplished  in  the  same  manner  as  before.  The 
combinational  logic  is  cut  at  the  half  delay  point  and  a 
register  is  inserted  as  in  Fig.  9b.  However,  in  this 
instance  new  operands  may  not  necessarily  be  introduced  at 
each  clock  interval.  The  circuit  has  a delay  or  time  length 
of  two  clock  intervals.  Thus  two  separate  tasks  may  be 
resident  in  the  machine  at  any  given  time,  with  one  task 
occupying  each  half  of  the  circuit.  New  operands  for  each 
task  are  presented  every  other  clock  interval.  As  before, 
the  finite  state  machine  mav  be  subdivided  into  m stages. 
In  this  case,  new  input  operands  for  each  task  are  presented 
once  every  m clock  intervals. 

The  parametric  eouations  for  the  m stage  pipeline  are 
slightly  different  from  before.  The  speed  of  a pipelined 
circuit  is  defined  as  the  effective  throughput  delav(e.R., 
the  time  between  tasks  rather  than  the  delay  of  a single 
task).  So,  the  time  is  set  equal  to  the  combinational  delay 
of  the  non-pipelined  case  plus  m times  the  register  delay. 
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all  divided  by  m. 


(2-19) 


where  ky  is  the  averap.e  delay  of  the  registers  between 
pipeline  stages.  Thus  pipelining,  by  subdivision  into  m 
stages  reduces  the  combinational  delay  by  a factor  of  m and 
adds  one  register  delay(  over  the  0 depree  pipeline). 


The  power  dissipation  for  a pipeline  is 


P 


W-, 

k^  + kqm  , 

^ ti 


(2-20) 


where  k«  is  the  average  register  power  dissipation, 
c 

The  area  of  the  pipeline  circuit  is  now 

A = k|j  + k^WjLj^  + k^W^Lj^  + k^m  , (2-21  ) 

where  kg  is  the  average  register  area. 


The  parametric  eauations  for  the  different  classes  of 
circuits  are  summarized  in  Table  2. 


Table  2 


Characterization  of  MOS  Circuits 


Function 


Deeree  of  Pipelining 


+ kpn’ 


♦ knm 


3.  OPTIMUM  CIRCUIT  SPEED,  POWER,  AMD  AREA 


> 


Given  the  rnodel  and  parametric  eauations  developed  in 
Chapter  Two,  it  is  now  possible  to  investigate  the 
dependence  of  MOS/LSI  circuit  speed,  power,  and  area  as 
functions  of  the  channel  dimensions  and  of  the  depree  of 
pipelining.  The  conditions  for  optimizing  each  of  the 
circuit  functions  individually  is  explored  first.  Then  the 
effects  of  Beta  ratio  are  examined. 


3. 1 Circuit  Area 


The  area  of  an  MOS/LSI  IC  is  directly  related  to  its 
cost.  Fairchild  Semiconductor  has  presented  a qualitative 
estimate  of  the  cost  per  unit  area  for  MOS  devices(Fip. 
10).  This  curve  may  be  further  approximated  by  a piecewise 
linear  fit,  also  in  Fig.  10.  The  left-hand  segment  has  a 
slope  near  zero,  so  cost  and  area  are  related  by  a constant 
coefficient.  Most  MOS  design  occurs  in  this  region,  so  this 


paper  will  simply  be  concerned  with  optirrurr  area.  P'or 


larger  ICs,  the  right-hand  portion  of  the  curve  suggests  a 


more  complex  relationship  between  cost  and  area  and  is 


beyond  the  scope  of  this  paper.  Put  in  either  case 


objective  of  minimum  cost  implies  minimum  area 


Prom  Table 


Obviously,  the  minimum  area  is  achieved  with  minima  for 


all  four  channel  dimensions  and  for  the 


pipelining,  m 


The  environmental  and  processing  parameters 


dictate  the  minimum  values  for  the  channel  widths  and 


In  the  case  of  m,  a value  of  one  would  be  used  for 


lengths 


machines  which  include  storage  elements,  while  m equal  to 


zero  applies  to  purely  combinational  logic 


3.2  Circuit  Power 


The  power  dissipated  on  an  IC,  from  Table  2,  is 


The  minimum  power  dissipation  result 


from  a mini'*'um 


for  W-i  , a '"aximum  for  L-,  , and  a minimum  for  m.  W,  and 

L-i  are  set  by  the  environmental  and  processing 

^max 

parameters,  which  are  dictated  to  the  designer.  The  value 


of  m is  either  zero  or  one  depending  on  the  circuit  in 


question,  as  in  the  case  of  minimurr  area. 


3.3  Circuit  Speed 


Also  from  Table  2,  the  speed  of  an  IC  is  piven  by  the 
f ormula 


T 


■1 


+ k 


2 W, 


+ k- 


(3-3) 


The  minimum  time  of  the  circuit  is  obtained  with  minima 
for  Lj  and  L^,  and  maxima  for  Wj,  W^,  and  m.  As  before,  the 
bounds  on  the  channel  dimensions  are  pre-determined  by 
processing  and  environmental  parameters.  For  m,  the  maximum 
value  is  a function  of  the  original  logic.  First,  the 
circuit  must  be  in  a form  which  is  pinel ineab] e . The 
simplest  pipelineable  circuit  is  one  that  is  entirely  serial 
in  nature.  Tasks  flow  from  one  area  of  logic  to  the  next 
with  no  feedback  from  a given  area  to  a previous  one.  To 
pipeline  such  a serial  circuit,  registers  may  be  inserted  in 
the  combinational  areas  between  levels  of  logic.  The 
maximum  number  of  registers  to  be  inserted  is  limited  to  the 
maximum  number  of  logic  levels  between  two  successive 
registers  in  the  original  logic  times  the  number  of 
combinational  sections.  The  time  delay  eouation,  3-3,  is 
valid  only  in  the  region  m=1  to  m eoual  to  the  upper  bound 
just  mentioned. 


Many  circuits  which  are  not  strictly  serial  are  still 
pipeline able . Finite-state  machines  may  be  pipelined  in 
much  the  same  fashion  as  serial  machines,  as  described  in 
Chapter  Two.  Circuits  which  are  combinations  of  several 
serial  or  finite-state  machines  are  also  sometimes 
pipelineable.  For  example,  each  section  of  a circuit  which 
is  a recognizable  serial  or  finite-state  machine  f"ay  be 
pipelined  independent  of  the  rest  of  the  circuit.  Selective 
use  of  this  technique  could  improve  circuit  throuphnut  when 
applied  to  the  "bottlenecks”  or  slowest  sections.  However, 
the  designer  must  take  care  that  the  time  dependencies 
between  the  sections  of  the  circuit  are  maintained. 

3.^  Peta  Ratio  Considerations 

Each  of  the  parametric  equations  - circuit  speed, 
circuit  area,  and  circuit  power  dissipation  - have  been 
optimized  as  functions  of  the  four  channel  dimensions  and 
the  degree  of  pipelining.  However,  it  is  common  industry 
practice  to  select  both.  and  to  take  on  their  minimum 

values,  ^^min>  respectively.  For  the  discussion 

of  more  complex  optimality  criteria  in  Chanter  4,  and 
will  be  considered  as  constants. 

•leveral  observations  may  be  made  concerninp  this 
choice.  First,  from  Chapter  1,  the  steady-state 
requirements  of  a logic  gate  create  a minimum  accentable 
Beta  ratio.  Recall  that 


Beta 


In  general  practice,  the  miniTurr  required  value  of  Seta  is 
greater  than  that  value  achieved  using  the  minima  for  the 
four  channel  dimensions  which  are  dictated  by  the  processinp 
and  environmental  parameters: 


Eeta^in  ^ 


^min 


Wt  L 
■'■min 


^min 


(3-5) 


In  order  to  achieve  the  necessary  Beta,  either  or  or 
both  must  be  increased  above  its  minimum,  even  if  the 
minimum  values  of  and  are  chosen.  Of  course,  if  or 
is  greater  than  its  minimum,  or  must  be  increased 
even  further. 

From  Table  2,  the  equation  for  circuit  power  varies 
directly  with  and  inversely  with  Here,  an  increase 

in  Wj  above  its  minimum  would  require  a proportionate 
increase  in  either  Lj^  or  so  the  power  could  at  best 

remain  the  same,  or  at  worst  increase  with  the  increase  in 

Wi. 

From  the  area  equation  in  Table  2,  increases  in  either 
or  Ljj  would  increase  total  area.  The  accompanying 
increase  in  or  required  by  Eetaj^j^p  would  further 


increase  area. 


So,  for  both  power  and  area,  it  is  obvious  that  the 
optimum  choices  for  and  are  their  minima. 

In  the  equation  for  soeed  in  Table  2,  both  V/]  and 
appear  in  denominators,  with  and  in  numerators.  ftn 

increase  in  requires  a proportionate  increase  in  the 
product  of  Lji  and  At  best,  this  would  leave  speed 

unchanged,  and  at  worst  increase  L-|  and  by  the  same 

amount,  thus  increasing  the  delay  ti^ne. 

Concerning  W^,  an  increase  above  its  mini'^um  would 

require  an  increase  in  the  product  of  L-|^  and  V^.  But  here, 
if  the  increases  were  only  applied  to  W-|^  and  to  the 

minimum  Beta  bound  would  be  satisfied  and  the  delay  time 
would  decrease.  In  the  extreme,  this  implies  that  it  is 

possible  to  grow  and  without  bound  in  order  to  reduce 
the  delay  time.  However,  as  a cbannel  width  increases 

beyond  a certain  limit,  the  simplified  MOB  model  fails  and 
the  delay  time  actually  begins  to  increase  with  increasing 

channel  width. 

In  effect , the  ki  term  in  the  speed  equation  is  the 

only  one  which  does  not  categorically  support  the  choice  of 

Wi  and  Lj  . For  a particular  case,  with  assigned 
^min  '^min 

values  for  the  kj*s,  the  simplification,  fixing  and  to 
their  minimum  values,  should  be  reexamined. 


Choosinp  Wi  and  f’roatly  siTpHfic?  the 

■'•min  ‘>in 

parametric  equations.  Let 


x?’ 

X3’ 

X?' 

X6' 


^min 


X?^d  . 
-Vin 


K ^ w ^ 


min 


X5“l 


min 


X6^d„, 


min 


, (3-M 
, (3-7) 
, (3-8) 
, and  (3-9) 


(3-10) 


With  these  substitutions,  the  equations  appear  as  in  Table 


45 


4.  OPTIMA  IMVOLVir-'G  TVQ  CRITF.f!IA 


Although  speed,  power,  and  area  are  each  of  interest  to 
the  circuit  designer,  it  is  usually  some  combination  of 
these  that  is  the  objective  to  be  achieved.  In  this 
chapter,  five  objective  functions  are  examined  for  the  case 
of  m>0,  i.e.,  sequential  machines.  These  include  the  three 
pairwise  products  of  the  parametric  equations  - speed-power, 
power-area,  and  speed-area.  Also,  both  minimum  area  and 
minimum  power  are  derived  assuming  a required  circuit  speed. 
In  addition,  optima  for  the  pairwise  products  are  derived 
for  m=0,  or  purely  combinational  logic. 

4.1  Speed  And  Power  Optima 

4.1.1  Speed-Power  Product  - 

The  speed-power  product  is  a function  mentioned  often, 
especially  when  comparing  various  IC  processes  (TTL,  FCL, 
MOS,  etc.).  As  mentioned  b'='fore,  the  speed  eouation 
actually  represents  a delay  time.  Although  some  references 


i; 

1 

1 


I 

t ’ 
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are  made  in  the  literature  to  the  power-delay  product  this 
paper  uses  the  term  speed-power  product,  which  is  more 
common.  Also,  pipelining,  a factor  not  usually  considered, 
is  included  in  this  analysis. 

Multiplication  of  the  power  and  speed  equations  gives 


PT 


K3' 

( TT"  '<8'"  ) ^ 


k^'Li  + 


■ + ky  ) 


'k3L^  + kykg 


m 


Li  m 


Wd 


(4-1) 

(4-2) 


From  the  form  of  equation  4-2  several  observations  can 
be  made.  First,  the  speed-power  product  is  inversely 
related  to  so  minimum  PT  implies  maximum  V/^. 


Optimum  values  for  m and  may  be  obtained  by  taking 
partial  derivatives: 


^ PT 
fc  m 


kyks  - 


k2 


, and  (4-3) 


PT 
^ L 1 


k-|  ’kg  - 


^2  ' *^3  ' k^  ' ky 


(4-4) 


These  two  partial  derivatives,  when  set  equal  to  zero, 
provide  two  simultaneous  equations  with  two  unknowns. 
However,  substitution  of  one  equation  into  the  other  results 
in  a fifth  order  equation,  for  which  no  algebraic  solution 
has  yet  been  found. 
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It  is  still  possible  to  derive  optimum  values  for  rn  and 
L]^,  assuming  that  one  of  them  is  fixed.  Setting  equation 
4-4  equal  to  zero  and  solving  for  the  positive  value  of 
yields 


^opt 

. kp  'k:> ' + k^  ' 

(4-5) 

\ 

k^  'kgk'^m 

Solving  equation 

4-3 

in  a similar  manner  gives 

-if 

'•<2  Vk^  ' + k 1 ' kp  ' 

(4-6) 

'"opt 

1 

l<7kpl-.'^Li 

These  two  values  are  graphed  in  Fig.  11. 


k more  reasonable  method  of  minimizing  the  speed-power 
product  is  one  of  the  many  known  methods  of  iteration. 
Either  equation  4-5  or  4-6  might  be  used,  in  conjunction 
with  4-2.  Such  an  approach  requires  values  for  the  constant 


coefficients  (k^'s).  Since  these  values  are  unavailable  for 


this  paper,  it  is  only  possible  to  make  comments  on  the  form 
and  shape  of  the  speed-power  product. 


Since  , Lj,  and  m are  all  physical  quantities,  it  is 
only  necessary  to  examine  positive  values  for  them.  The 
general  shape  of  PT  with  respect  to  ¥^,  L^,  and  m, 
individually,  is  depicted  in  Fig.  12.  As  mentioned  before, 
the  optimum  value  for  is  a maximum,  independent  of  the 


values  of  and  m.  The  labels  and  Lj  ^ are  the 


values  derived  in  equations  4-5  and  4-6, 


Optimum  values  for  and  m,  however,  are  hiphly 
interdependent.  In  Fig.  13,  the  shape  of  PT  is  shown  as  a 
function  of  both  Lj  and  m.  Since  the  shape  is  a 
hyperboloid,  it  is  straightforward  to  iterate  to  the  minimum 
value  for  PT  once  the  coefficients  are  evaluated. 

The  shape  of  Fig.  13  suggests  an  optimum  that  takes 
into  account  both  inherent  device  speed  (through  ) and 
speed  enhancement  with  pipelining  (through  m).  However, 
this  optimum  may  be  unattainable  for  several  reasons.  For 
L^,  the  optimum  may  require  a value  too  large  or  too  small 
with  respect  to  processing  or  environmental  restrictions. 
Likewise,  the  optimum  value  of  m may  be  less  than  one  or 
greater  than  the  maximum  possible  degree  of  pipelining  for  a 
particular  circuit.  In  any  of  these  cases,  it  is  only 
possible  to  choose  the  permissible  values  of  and  m which 
yield  a figure  for  PT  as  near  to  the  minimum  as  possible. 


4.1.2  Minimum  Power  For  Specified  Speed  - 

Often,  a circuit  designer  has  a predetermined  design 
goal  for  the  speed  of  a circuit.  This  then  becomes  a 
constraint  with  respect  to  other  design  objectives.  The 
particular  objective  function  examined  in  this  section  is 
the  optimization  of  power  dissipation,  assuming  a bound  on 


the  circuit  speed. 


A 


From  Table  3f  the  speed  of  a circuit  is  given  by  the 


f ormula 


V, 


+ !<• 


(^-7) 


Now,  assume  that  the  circuit  in  question  must  meet  the 
requirement 


T < t. 


(4-8) 


It  is  now  possible  to  relate  the  degree  of  pipelining,  m,  to 
Li,  , and  tQ  by  setting  equation  4-7  eaual  to  tg  and 
solving  for  m: 


+ -y- 


tg  - ky 


(4-9) 


This  value  for  m is  then  substituted  into  the  equation 
for  P from  Table  3: 


xr4  + ir 


Li  0 tg  - ky 


(4-10) 


A maximum  for  leads  to  a minimum  for  this  equation.  This 
equation  may  be  further  optimized  by  taking  its  partial 
derivative  with  respect  to 


b P _ ''^8 

i L 1 t ri  *■  k r, 


(4-11) 


0 ■ *'7  t.- 


Setting  this  equation  equal  to  zero  and  solving  for 
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which  must  be  positive,  yields  the  result 


kl'kg 


This  equation  represents  the  optimum  choice  of  Lj  in 
order  to  obtain  minimum  power  dissipation,  for  a specified 
bound  on  speed,  tQ.  This  value  for  may  then  be 

substituted  into  equation  '1-7,  Riving 


"opt 


• (4-13) 


Equations  4-12  and  4-13  now  provide  the  optimum  values  for 
and  m as  functions  of  a speed  bound,  tg,  as  shown  in  Fig. 
14.  The  optimum  increases  monoton ica lly  proportional  to 
the  square  root  of  tQ-ky.  In  other  words,  as  the  speed 
bound  is  relaxed  (larger  tQ  implies  a slower  requirement), 
is  larger  in  order  to  achieve  the  minimum  power. 
Likewise,  m decreases  monotonically  with,  increasing  tQ 
(eventually  proportional  to  the  inverse  of  the  square  root 
of  tQ),  because  the  degree  of  pipelining  required  to  achieve 
a slower  speed  is  less. 


In  the  case  of  minimum  power  with  a speed  bound,  it  is 
possible  to  calculate  the  optimum  values  for  Lj,  , and  m 
algebraically.  Iterative  techniques  are  not  necessary. 
These  optimum  values  may  then  be  substituted  into  the 
equation  for  power  in  Table  3 to  provide  the  minimum  power 
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for  the  specified  tg. 


It  is  interesting  to  examine  the  case  where  the 
designer  is  restricted  to  a degree  one  pipeline,  or  tr=1. 
This  represents  a simple  finite  state  machine  approach. 
From  Fig.  14b,  it  is  obvious  that  for  tp  greater  than  or 
equal  to  x,  the  optimum  choice  for  m is  indeed  one.  Put  for 
any  choice  of  tQ  less  than  x,  m>1  is  optimum.  So  in  this 
range  , a restriction  of  m=1  will  yield  a value  for  power 
which  is  greater  than  the  best  obtainable. 


4.2  Area-Power  Product 


Multiplication  of  the  area  and  power  equations  yields 
the  formula 

AP  = ( !<4  + k5'Li  + + kgm  ) (4-14) 

- “3  XS  - n ^ 

+ kilkgm  + kg'kgL^m  + kg'kgW^^m  + kgkgm^  . (4-15) 


A minimum  for  this  equation  requires  minima  for  both 
and  m,  as  they  appear  only  in  numerators.  To  obtain  the 

proper  value  for  Lj,  consider 

V »p  ko’kii  + ko'kc'W.  + ko'kom 

= kg’kgm-—^ — il 3 — 6 — d 3 — ^ (4-16) 


Setting  equation  4-16  to  zero  and  solving  for  which  must 
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be  positive,  yields  the  formula 


L 


1 


1^3 '1'4  + 


i<5'kgm 


(4-17) 


The  variation  of  the  area-power  product  with  respect  to 
each  of  the  three  variables  is  demonstrated  in  Fig.  15.  As 
mentioned  before,  the  optimum  values  for  , and  m may 
not  be  physically  exactly  realizeable.  Interestingly,  the 
restriction  of  m=1  yields  the  optimum  value  of  AP,  as  shown 
in  Fig.  15c.  An  increase  in  the  degree  of  pipelining  is 
useful  to  increase  speed,  which  is  not  considered  in  this 
section . 


4.3  Speed  And  Area  Optima 
4.3.1  Speed-Area  Product  - 

The  product  of  the  speed  and  area  equations  is  a design 
criterion  of  great  interest.  Often,  the  speed  of  a system 
is  used  to  denote  its  per f ormance . Also,  in  this  case,  area 
is  related  to  cost.  So,  speed-area  is  actually  a measure  of 
a circuit’s  performance-cost  ratio,  since  speed  is 
characterized  by  its  inverse,  time.  A minimum  speed-area 
product  is  related  to  minimizing  both  time  and  cost, 
providing  the  maximum  performance-  cost  ratio. 


Figure  15.  Variation  of  Area-Power  Product  with  Respect 
to  L j W , and  m,  Individually 


I 


Multiplication  of  the  speed  and  area  equations  from 
Table  3 gives  the  equation 


rn  Wjm 


)(  kij  + 1<5’Li  + kg'W^  + kqm  ) (^1-18) 


ki|ky  + ( k-i'kg  + k^'ky  )L;|^  + ><6''<yW^  + kYkqm 

* "~W7~  m 


(^-19) 


In  eauaticn  4-19,  appears  only  in  the  numerator,  so 
minimum  TA  implies  minimum  L^.  Both  and  m appear  in  a 
more  complex  fashion.  Their  optimum  values  might  he 
attained  by  taking  partial  derivatives: 


kp'kn  ko'ka  ko'kq'Li 

^e'^7  -^-T + U— - _2_L_i  , (1|_20) 


W^^m 


^'d^m 


kykg  - 


k^'kg’  k2'*<4  k^’k^’Li' 


Xrke’WdLi  k2’k5*Li 


(4-21  ) 


But,  like  optimum  speed-power  oroduct,  the  optimum 
speed-area  product  does  not  lend  itself  to  algebraic 
solution.  However,  optimum  values  for  and  m may  be 

obtained  if  one  of  them  is  considered  fixed,  following  the 
example  in  section  4.1.1,  When  values  for  the  various 


59 


constant  coefficients  are  available,  iteration  to  the 
minimum  value  of  TA  may  be  used. 

The  variation  of  TA  with  respect  to  each  cf  the  three 
parameters  individually  is  depicted  in  Fip.  1b.  Since 

and  m have  optimum  values  which  are  neither  minima  nor 
maxima,  the  optimum  choices  are  interdependent,  as 
illustrated  in  Fig. 17.  This  relationship  demonstrates  the 
tradeoff  betv/een  device  speed  (via  W^)  and  speed  enhancement 
through  pipelining  (via  r).  Of  particular  interest  is  the 
location  of  the  m=1  plane  in  Fig.  17,  which  specifies  the 
available  values  of  TA  without  pipelining.  Several 

possibilities  for  the  location  of  this  plane  exist. 

. If  the  m=1  plane  intersects  the  hyperboloid  at  its 
minimum  point,  then  the  optimum  performance-cost  ratio  may 
be  obtained  by  simply  choosing  the  aopropriate  value  of  W^. 
If  the  minimum  value  for  TA  occurs  at  m<1,  then  the  optimum 
performance-cost  ratio  is  unattainable,  and  m=1  should  be 

chosen.  Then  the  optimum  value  of  for  m=1  should  be 

selected . 

Finally,  if  the  minimum  value  for  TA  occurs  at  rr>1, 
then  an  optimum,  performance-cost  ratio  may  only  be  achieved 
with  the  aid  of  pipelining.  A choice  of  m=1  restricts  the 
value  to  the  minimum  point  on  the  TA  surface  which 
intersects  the  m=1  plane. 


Slope  = k7  kg 


(c) 


FP-S45J 


Figure  16. 


Variation  of  TA  with  Respect  to  L , 

f 

W , and  m,  Individually 
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4.3.2  Minimum  Area  For  Specified  Speed  - 

An  optimum  performance-cost  ratio,  as  presented  in  the 
previous  section,  is  not  always  desirable.  There  often 
exists  a bound  requirement  on  the  performance,  or  speed,  of 
the  circuit.  In  this  situation,  it  is  useful  to  calculate  a 
minimum  area,  or  cost,  as  a function  of  a speed  bound,  t^. 


In  section  4.1.2,  the  equation  for  speed  was  set  eoual 
to  tg.  Then,  this  equation  was  solved  for  m in  equation 
4-7: 


m 


xrh. 


(4-22) 


It  is  also  possible  to  solve  this  equation  for  W^. 

IV 

" m ( tg  - ky  ) - 'L^ 


(4-23) 


Mow,  the  dependence  of  and  m on  tg  is  graphed  in  Fig. 
18.  As  tg  increases  the  circuit  is  slower,  so  either  device 
speed  or  pipelining,  or  both,  may  be  reduced. 


To  obtain  a minimal  area,  either  equation  4-22  or  4-23 
may  be  substituted  into  the  equation  for  area  from  Table  3. 
Using  equation  4-22, 


taik 


I 
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k^  'Ll  + ,^ 


kn  + k5'Li  + kg'W^  + kg- 


to  “ k' 


(4-24) 


k2’ko 


k 1 ' kgL j 

k4  + !<5’Li  + k5'Wg  + to  -'ky  + ( to^-^’  ) 


. (4-25) 


As  before,  area  is  directly  related  to  Li  and  minimum  area 
implies  a minimum  for  Li* 


The  term  tQ  - k-^  always  appears  in  a denominator  so  A 
varies  inversely  with  the  speed  bound  Pq  (In  the  speed 
equation  in  Table  3,  Pq  is  always  greater  than  or  equal  to 
ky,  so  pQ  - ky  is  always  positive.).  On  the  other  hand, 


The  optimum  choice  for  may  be  obtained  by  partial 
differentiation : 


A 


^ W, 


»<6'  - 


k2'l<9 


( to  - k7  ) W, 


(4-26) 


Equation  4-26  is  set  equal  to  zero  and  solved  for  W^,  which 
must  be  positive,  to  yield 


W. 


’opt 


k2  'kg 


(4-27) 


( tn  - 1<7  ) 


Eouation  4-27  is  the  value  of  which  minimizes  A for  a 
speed  bound  of  Pq. 


appears  in 

both  a numerator 

and 

a denominator,  so  the 

; s 

relationship 

between 

A 

and 

is  hyperbolic.  These 

■ ■/ 
} 

•f 

dependencies 

are  shown 

in 

Fig . 

19. 

^ i' 

1 I 
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1 
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c^,  and  Assuming 
+ k'/lJ 


Figure  19.  Dependence  of  Area  on  L 


Slope  = ks  + 

y 

k4  + ks  Li 

Now,  equation  M-23  is  substituted  into  the  area 


equation ; 


!<5*Li  ^ 


m ( 


^0  ■ 


k y,  ) — k 1 ’ L 1 


. (M-28) 


Both  and  tg  exhibit  the  same  effects  on  area  as  in 

equation  4-25  and  in  Fig.  19.  The  variation  of  A with 
respect  to  m is  hyperbolic  and  is  depicted  in  Fig.  20.  The 
value  of  m which  produces  a minimum  for  equation  4-2R  is 
achieved  by  a partial  derivative: 


kg»k6'  ( tp  - ky  ) 

( m (tQ  - ky  ) - k^  'L^ 


(4-29) 


Setting  equation  4-29  equal  to  zero  and  solving  for  m 
provides  the  optimum  value  for  m: 


*^1  '^1  -l(  *-2'‘^6' 

tQ  - ky  |kg  ( tQ  - ky  ) 


. (4-30) 


From  equation  4-22,  the  restriction  of  speed  equal  to 
tg  requires  that  the  degree  of  pipelining  exceed 
k 'L j^/( tg-ky ) . Mote  that  when  m reaches  this  value, 
infinite  area  is  requir'jd  (see  Fig.  20),  corresponding  to 
infinite  W^.  If  this  value  is  greater  than  one,  then  the 
speed  bound  is  unobtainable  without  a higher  degree  of 
pipelining.  And,  as  before,  even  if  m=1  is  available  as  an 
option,  it  may  severely  increase  area  if  in  equation 
4-30  is  significantly  greater  than  one. 


I 
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Optima  For  Combinational  Logic 

The  set  of  equations  in  Table  3 for  m = 0 describe  the 
model  for  purely  combinational  logic.  In  this  section,  the 
pairwise  products  of  these  functions  are  optimized 
speed-power,  oower-area,  and  speed-area  under  an  m=0 
constraint. 

4.4.1  Optimum  Speed-Power  Product  - 

Multiplication  of  the  speed  and  power  equations  for  m=0 
in  Table  3 yields 


kp'ko' 

PT  = ki'k3'  . (4-31) 

As  is  obvious  from  this  equation,  PT  increases  linearly 
with  reciprocals  of  and  Therefore,  PT  is  minimized 

by  choosing  maxima  for  L-j^  and  Of  course,  environmental 

and  processing  parameters  place  upper  bounds  on  both  of 
these  channel  dimensions. 

4.4.2  Optimum  Power-Area  Product  - 

Equation  4-32  shows  the  power-area  product  for 
combinational  logic. 


1; 


It 


l<3'k5'  + 


(^-3?) 


AP  grows  linearly  with  anr"  'with  the  reciprocal  of  Lj.  In 

this  case  the  optimum  L|  is  a maximum  while  the  optimur. 


IS  a mimirr.UTp.. 


4.4.3  Optimum  Speod-Aren  Product  - 


The  speed-area  product  is  a measure  of  the  performance 
cost  ratio  of  the  circuit.  For  strictly  combinational 
logic,  this  product  is: 

TA  = l<2'kQ'  + ki'kiiLj  ^ !ci'k5’L3^  + ki'kg’LjW^^ 


kp'kji 


(4-33) 


Ti  increases  linearly  and  quadractically  with  L^,  so  a 
minimum  fer  is  desired.  with  resnect  to  , however, 
'ttelbher  a mifl-imuffl—nor -a -piax iouirr:  is  the  optimum.  As  before, 
the  optimum  is  achieved  by  taking  a partial  derivative  and 
solving  for  the  positive  root: 


ir^p'^4  ' ^5  '^1 


(4-34) 


■ V ' .t-' 


70 


This  value  for  minimizes  TA  for  purely  combinational 
logic . 

Table  4 provides  a summary  of  the  results  of  the 
optimizations  in  chapters  three  and  four. 


a 


Optimal  Values  of  , Lj,  ^ind  m 


Function 


Area 


Power 


Speed 


no  effect  min 


Speed-Power  intermediate  -A  max 

Power-Area  ] 1<3  ' min 


intermediate  -A 


l<5’!<gm 


Speed-Area  min 
Power(T  = to)  l|^3  ' ^ 

I i<rk8 


Area(T=tn)  min 


intermediate  -B  intermediate  -B 

max  + ^2* 

' >'d 


1<2''<9 


( tQ-ky ) 

kl'Li  ^-rviv 


^'0~*‘7  ll  '^9  ( t.Q-l<y  ) 


Area 


Power 


Speed 


no  effect 


Speed-Power  m.ax 
Power-Area  max 
Speed-Area  min 


l|l<2  "<4->-kp  M<[^ 'L-j 

f ■ 


* - Ho  closed  form,  numerical  solution  easily  obtained 
A - Solve  equations  (4-3)  = (4-4)  = 0 
B - Solve  equations  (4-20)  = (4-21)  = 0 
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5.  NUMERICAL  EXAMPLES 


In  order  to  illustrate  the  decisions  and  trade-offs  in 
j the  design  process  further,  two  numerical  examples  are 

' presented  in  this  chapter.  The  equations  used  are  those  in 

Table  3 for  m>0,  and  the  methods  of  ootimization  are 
detailed  in  Table  A. 


The  values  chosen  for  the  k^^’s  are  arbitrary,  since  the 
actual  values  depend  both  on  the  circuit  under  consideration 
and  on  proprietary  process  parameters.  It  should  be  noted 
that  the  particular  values  used  were  selected  to  illustrate 
certain  points. 

In  addition,  arbitrary  bounds  are  selected  for  the 
parameters.  L^  will  be  allowed  to  range  from  1 to  10 
inclusive,  and  from  0.1  to  5 inclusive.  Also,  the  degree 
of  pipelining,  m,  may  vary  between  1 and  5. 
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5 . 1 Example  One 

For  this  example,  the  kj^'s  are  assigned  values  as 
follows : 

k^'  = 4 ki,  =10  k^  = 1 1 

k;,'  = 1 k3'  = 1 kp  = 1 

k3’  = 2 k(^'  = 2 ko  = 9 . (5-1) 

Using  these  values  and  Table  3 and  Table  4,  optimum  values 
are  calculated  for  speed,  area,  and  time  as  well  as  their 
pairwise  products.  The  results  are  presented  in  Table  5. 

Each  row  of  the  table  presents  the  equation  which  is 
optimized,  the  corresponding  values  of  W^,  Li,  and  m,  and 
the  values  of  all  the  pertinent  equations.  The  optimum 
value  for  each  objective  is  circled  for  clarity. 

In  the  cases  of  optimum  speed-area (TA ) and 
speed-power (PT)  products,  the  desired  value  of  m is  less 
than  one.  Since  a pipeline  of  decree  one  represents  a 
finite-state  machine,  one  is  the  minimum  useable  value  for  m 
in  a finite-state  machine.  The  lov/er  section  of  Table  5 
shows  the  optimum^  TA  and  PT  for  m restricted  to  one.  This 
provides  the  minimum  obtainable  values  in  cases  where  the 
values  computed  are  unrealizable. 

The  entry  for  optimur'  speed(T)  'uith  m restricted  to  one 
shows  the  best  obtainable  without  pinelininc,  i.e.,  15.2. 

This  can  be  contrasted  with  the  optimum  with  pipelining, 
11.8.  The  maximum  value  for  m is  used  because  area  and 


Table  5. 


Numerical  Example  1 


Optimized 


inction  m 

^1 

A 

P 

T 

TA 

PT 

AP 

TA 

0.815 

0.84 

1.0 

20.0 

2.82 

17.4 

48.9 

56.3 

PT 

0.862 

5.0 

2.37 

30.1 

1.71 

22.2 

670.  1 

37. 9i 

51.4 

AP 

1 .0 

0.1 

6.2 

25.4 

1.32 

45.8 

1163. 

60.6 

!33.6| 

A 

1.0 

0.1 

1.0 

ijpUI 

3.0 

25.0 

505. 

75.0 

60.6 

P 

1 .0 

0.1 

10.0 

29.2 

1 .2' 

61.0 

1781 . 

73.2 

35.0 

T 

5.0 

5.0 

1.0 

66.0 

7.0 

iHTT  8i 

781. 

82.9 

462. 

m restricted  to  1 

TA 

1.0 

0.815 

1.0 

21 .6 

3.0 

16.2 

Unj 

48.7 

64.9 

PT 

1.0 

5.0 

2.37 

31.4 

1.84 

20.7 

649.  i 38.11 

57.8 

T 

1.0 

5.0 

1.0 

30.0 

3.0 

|15.2( 

456. 

45.6 

90.0 

--  t . 


power  are  not  considered.  For  this  example,  speed  is  the 
only  equation  whose  optimum  suffers  by  restriction  to 
non-pipelining. 

It  is  possible  to  solve  for  an  objective  which  takes 
into  account  all  three  functions:  speed,  power,  and  area. 

For  instance,  one  might  wish  to  minimize  the  speed-area 
product  under  the  constraint  that  power  must  not  exceed  some 
maximum  value,  P^ax*  power  equation  from  Table  3 is 

solved  for  one  parameter,  say  This  formula  is 

substituted  for  in  equation  'J-IQ.  The  resulting  formula 

for  TA  is  a function  of  variables  and  m and  the  constant 
Iterative  techniques  are  now  applicable,  as  in 
Chapter  M.  It  should  he  noted  that  this  procedure  is  only 
valid  for  those  values  of  which  are  less  than  that 

achieved  when  simply  minimizing  TA  v/ithout  regard  to  power, 
or  2.82  from  Table  5.  The  results  for  minimum  TA  v/ith  a 
power  bound  are  presented  in  Table  6.  As  the  maximum  power 
is  decreased  the  minimum  achievable  TA  increases. 


5.2  Example  Two 

This  example  is  presented  to  show  an  example  in  which 
pipelining  is  attractive.  The  k^'s  are  assigned  the 
following  values: 


Minimum  TA  for  Specified  Maximum  P 


Pmax  -T,  Li  A T TA  PT  AP 


2.75 

1.0 

0.804 

1.14 

21.8 

16.8 

366 . 

46.2 

59.8 

2.5 

1.0 

0.789 

1.33 

21.9 

17.6 

386. 

44.0 

54.8 

2.25 

1.0 

0.769 

1.6 

22.1 

18.7 

414. 

42.1 

49.8 

2,0 

1.0 

0.743 

2.0 

22.5 

20.3 

457. 

40.7 

45.0 

1.75 

1.0 

0.707 

2.67 

23.1 

23.1 

533. 

40.4 

40.4 

1.5 

1 .0 

0.653 

4.0 

24.3 

28.5 

693. 

42.8 

36.5 

1.25 

1.0 

0.56 

8.0 

28.1 

44.8 

1259. 

56.0 

35.2 

t’ 
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l<1  ' = 6 kij  = 10  ky  = 4 

k2'  = 2 1(5'  = ? kg  = 1 

k3*  = 2 kf^'  = 'I  1<Q  = 3 . (5-2) 

Using  the  equations  of  Table  3 and  the  methods  of  Table 
the  results  are  generated  and  are  shown  in  Table  7.  In  this 
example  all  functions  dealing  with  speed  (TA,  PT , and  T) 
benefit  from  pipelining. 

In  the  case  of  optimum  TA,  oipelining  provides  a 27 
percent  decrease  over  non-pipelining.  The  decrease  for  PT 
is  almost  6 percent,  while  T decreases  by  49  percent.  The 
trade-off  is  essentially  between  the  delay  in  combinational 
logic  and  the  area  and  power  required  by  additional 
registers.  For  speed,  T,  the  maximum  allowable  figure  for  m 
is  chosen  because  area  and  power  are  not  considered. 
Incidentally,  the  optimum  values  for  TA  and  PT  in  Table  6 
are  the  best  obtainable  with  integer  values  of  m.  The 
equations  yield  optimum  m values  of  3.29  and  1.78, 
respectively . 

As  with  example  one,  it  is  nossible  to  find  the  minimum 
speed-area  product(TA)  subject  to  an  upper  bound  on  power, 
Pj^gjj.  In  this  case  the  value  of  Pp,ax  '’’ust  stay  less  than  or 
equal  to  5.0,  from  Table  7.  The  minimum  TA  is  shown  in 
Table  8 for  various  values  of  before,  the  optimum 

TA  increases  as  the  power  bound  is  tightened  or  lowered. 
However,  minimum  TA  does  not  change  monotonically  because  L]. 
is  bounded  by  1 and  the  optima  using  P^ax  values  of  4.5  and 
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Table  7 


Numerical  Example  2 


Optimized 
Function  m W 


9 


ble  8. 

Specified  Maximum.  P 


5.0 

3.0 

0.764 

1.0 

ii.5 

3.0 

0.736 

1.33 

4.0 

2.0 

0.802 

1.0 

3.5 

2.0 

0.764 

1.33 

3.0 

1 .0 

0.866 

1 .0 

2.5 

1 .0 

0.808 

1.33 

2.0 

1.0 

0.729 

2.0 

1.5 

1.0 

0.612 

4.0 

^ i 

'i 

A 


A 

T 

TA 

PT 

AP 

24.1 

6.87 

165. 

34.4 

120. 

24.6 

7.57 

186. 

34.1 

111. 

21.2 

8.25 

175. 

33.0 

84.8 

21.7 

9.31 

202. 

32.6 

76.0 

18.5 

12.3 

227. 

36.9 

55.4 

18.9 

14.5 

274. 

36.2 

47.2 

19.9 

18.7 

373. 

37.5 

39.8 

23.4 

31.3 

733. 

46.9 

35.2 

■i  I 


I 


WimiBP'w 
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f 


3.5  occur  with  less  than  1.  In  effect,  increasing? 

above  the  optimum  choice  forces  m,  an  intefrer,  to  change 
value  which  in  turn  forces  TA  to  take  on  a higher  minimum 
value.  In  addition,  the  degree  of  pipelining,  m,  is  forced 
lower  by  the  decreasing  power  bound. 
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6.  COMCLIJS.IOM 


In  this  paper,  a simplified  model  was  devised  for  the 
logic  in  MOS/LSI  circuits.  First,  a single  MOS  inverter  was 
modeled,  and  its  speed,  power  dissipation,  and  area  were 
expressed  as  functions  of  transistor  channel  dimensions. 
This  model  was  then  scaled  to  represent  the  combinational 
logic  of  an  entire  circuit.  The  channel  dimensions  in  the 
equations  became  the  effective  or  average  dimensions  for  the 
circuit  as  a whole.  A register  model  was  introduced  and  the 
circuit  model  was  extended  to  reflect  the  effects  of 
pipelining.  Then,  the  speed,  power,  and  area  of  the  circuit 
were  formulated  using  the  parameters  of  the  model:  channel 
widths  and  lengths  and  the  degree  of  pipelining.  These 
equations  were  expressed  with  unevaluated  constant 
coefficients  which  deoend  on  proprietary  industrv 
information  and  on  the  specific  circuit  being  analyzed. 
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The  equations  describinp;  the  circuit  were  then 
individually  optimized  with  respect  to  the  oarameters. 
'Minimum  area  results  from  minimization  of  all  of  the 
parameteres.  Minimum  power  requires  maximum  load  channel 
length,  minimum  load  width,  and  minimum  pipelining  and  was 
independent  of  the  driver  channel  dimensions.  Optimum  speed 
(minimum  time)  requires  minimum  channel  lengths,  maximum 
channel  widths,  and  maximum  pipelining. 


After  these  observations,  the  model  was  further 
simplified  to  facilitate  the  study  of  more  complex  and  more 
interesting  optima.  In  this  model,  the  circuit  functions 
were  dependent  only  on  load  channe)  length,  driver  channel 
width,  and  degree  of  pipelining,  with  load  channel  width  and 
driver  channel  length  set  to  their  minimum  values. 


i ** 

I •’ 

i ' 

: ^ 


The  first  function  considered  was  the  product  of  speed 
and  power.  Although  a closed  form  solution  was  not 
obtainable  for  the  optimal  parameter  values,  each  optimum 
parameter  value  was  expressed  in  terms  of  the  other 
parameters.  These  equations  could  then  he  used  to  iterate 
to  the  minimum  for  the  speed-power  product.  This  procedure 
will  succeed  since  the  shape  of  PT  is  a hyperboloid.  In 
addition,  power  was  minimized  assuming  a bound  on  speed.  In 


this  case,  the  optimum  values  for  the  parameters  were 
derived . 
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Next,  the  product  of  area  and  power  was  considered. 
Here,  optimal  values  for  all  three  parameters  v;ere  attained. 

Optima  involving  speed  and  area  were  dealt  with  next. 
The  minimum  speed-area  product,  like  speed-power,  could  not 
be  solved  in  a closed  form.  However,  since  it  was  also  a 
hyperboloid,  iterative  technipurs  would  succeed.  Also,  an=a 
was  minimized  for  a bound  on  sreed,  and  optimal  values  were 
obtained  for  all  parameters. 

For  those  circuits  which  are  purely  combinational, 
optimal  parameters  values  for  single  and  pairv;ise  nroducts 
of  the  objective  functions  were  also  derived  and  tabulated. 

Finally,  two  numerical  examples  were  presented  to 
illustrate  the  methods  involved.  The  unevaluated 
coefficients  were  assigned  arhitrary  values. 

The  .models  and  circuit  functions  derived  are  useful  in 
describing  the  nature  of  the  various  trade-offs  and 
sacrifices  inherent  in  yOS/LSI  circuit  design.  With  them, 
it  can  be  shown  whether  a circuit  would  benefit  from 
particular  design  choices,  involving  various  channel 
dimensions  and  the  use  of  pipelining.  If  the  constant 
coefficients  are  evaluated  using  the  knowledge  of  a FO.S/LSI 
semiconductor  manufacturer,  then  near  optimum  design  choices 


may  be  made. 
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